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Abstract 

We propose an approach based on analysis of cost-quality tradeoffs for com- 
parison of efficiency of various algorithms for small-world network construction. A 
number of both known in the literature and original algorithms for complex small- 
world networks construction are shortly reviewed and compared. The networks 
constructed on the basis of these algorithms have basic structure of ID regular lat- 
tice with additional shortcuts providing the small-world properties. It is shown that 
networks proposed in this work have the best cost-quality ratio in the considered 
class. 

1 Introduction 

There exists a number of important cases of real world networks which can be mod- 
eled by a regular lattice with additional long-range connections between nodes (shortcuts). 
While the local connectivity structure for such a networks remains similar to the original 
underlying lattice, the shortest path between any two distant lattice nodes becomes much 
smaller so that the resulting network possesses the small-world property with at most log- 
arithmic increase of the mean distance between nodes with growth of the system size (see, 
e.g. |T] and refs therein). Examples of such networks includes linear polymer networks, 
transportation networks, in vitro neuronal networks (neuronal cultures), lattice models for 
space-time with wormholes, mobile and wireless networks and interconnection networks 
for supercomputers. For definiteness, in this work we have in mind as a principal use 
case designing an optimal interconnection networks for future generation supercomputers 
though the results obtained can be applied to other real networks of the type considered. 

After the Petaflop (10 15 FLOPS) performance barrier for supercomputers was bro- 
ken, the high-performance computing community is exploring development of the prin- 
ciples for creation of next generation systems with performance of the order of Exaflop 
(10 18 FLOPS) [2]. Though first Exaflop systems are estimated to be built between 2018 
and 2020, approaches and basic principles of their construction are currently being ac- 
tively investigated because on a way to their creation it is necessary to overcome a number 
of complex scientific and technical challenges and to elaborate essentially new approaches 
to development of their architecture and the hardware implementation. 



One of the most important components of any supercomputer is the interconnection 
network which primarily defines ability to increase the number of computing nodes that is 
necessary for achieving the desirable performance. According to very modest estimations 
(see, for example, [3]), interconnection networks of Exaflop supercomputers must have 
more than 100 000 endpoints/CPUs (each most likely with many cores). Thus, one of 
key problems which should be solved on a way to next generation supercomputers is the 
development of interconnection network architecture with good scalability and ability to 
provide communication between huge number of computing nodes. 

Three basic aspects of any interconnection network which mostly define its function- 
ality are: 

• network topology, 

• flow control, and 

• routing algorithm. 

In this work we discuss mainly network topology. Two other aspects are quite impor- 
tant but out of the scope of this paper. A correct choice of network topology is vital for 
network design because both flow control and routing algorithms are heavily based on the 
topological properties of the network under consideration. Moreover, we shall consider 
only so called direct interconnection networks in which every node serves both as a pro- 
cessor unit and as a router. Indirect networks have their own merits for relatively small 
networks but are not well scalable. Among direct networks the interconnection networks 
with lattice structure and D-dimensional torus topology are widely used. In supercom- 
puting literature such networks are termed "/c-ary n-cube" [I] (n is the torus dimension, 
in our notations n = D\ k is the number of nodes along each dimension). These networks 
possess a number of advantages, in particular: 

• high resilience of the networks as a whole in case of failure of some nodes (existence 
of bypass communication paths), 

• hypercube of dimensionality which is enough for a given number of nodes provides 
small average distance between nodes, 

• in case of simulation of D-dimensional objects, the structure of the computing tasks 
is mapped to the D-dimensional lattice of computing nodes in an optimal way; it is 
worth mentioning that just tasks of this kind (especially for D — 3) are supposed 
to constitute a considerable part of tasks for the next generation supercomputers. 

However in the case of huge number of nodes typical for next generation supercom- 
puters, the D-dimensional lattice toruses have also essential shortcomings. In particular, 
lattices of low dimensionality have very large average path length between nodes while 
lattices of high dimensionality, comparable with logarithm of number of nodes, are incon- 
venient for embedding into real physical space and therefore for implementation because 
of huge length of communication links. On the other hand, it is well known that intercon- 
nection networks with minimal average distance for a given number of nodes have the best 
operation characteristics, such as, e.g., performance and robustness (see, for example, [5]). 
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Figure 1: A simplified form of the lattice networks supplemented with shortcuts in one- 
dimensional (a) and two-dimensional (b) cases 

Thus usual networks with simple structure of the regular lattices will be ineffective for 
solution of tasks of general type except those related to triangulation of D- dimensional 
objects. 

For that matter it seems attractive to use for the next generation supercomputers 
interconnection networks with small-world property. The point is that while for the 
regular D- dimensional lattice the average distance d between nodes grows as the power 
of number of nodes: d ~ N 1 ^ , in the case of a network with the small- world properties d 
grows essentially slower: d ~ In N. It is worth mentioning that some complex networks 
have average distance that depends on the number of nodes in the power-like manner 
similarly to the regular lattices but with the exponent which is significantly smaller: 
d ~ iV 7 , 7 <C l/D. Quite often investigation of network properties are carried out by 
means of numerical simulations and it is not easy to distinguish such power-like behavior 
from logarithmic one. From the applications point of view and in the case of small enough 
exponent the difference is unessential too. Therefore we will consider that such networks 
also possess the small-world property. 

In the classical version [5] complex networks with small-world properties appears at 
the intermediate stage in the process of stochastic transformation of a regular lattice into 
Erdos-Renyi random graphs [7] , [8] , the structure of the regular lattice is being broken. As 
it was mentioned above, this is undesirable for the interconnection networks. Therefore 
in this work we shall use the modification of the small-world networks construction in 
which the original lattice is preserved and is supplemented by additional links (shortcuts) 
providing the small- world properties (see p], sect. IV. C. 2 and refs. therein). A simplified 
form of such networks in one- and two-dimensional cases is depicted in fig. [T} Below for 
brevity we shall use for such networks the term "lattice networks with shortcuts" (LNS). 

It is necessary to note that besides the small average distance between nodes one more 
distinctive property of the small- world networks is high degree of clusterization 
High clusterization provides local fault tolerance of the networks because of existence of 
local bypass paths in the case of a node failure. Note, however, that in the case of LNS 
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with D > 1 such a local fault tolerance (existence of local bypass paths) is provided by 
the underlying lattice. 

Path length (distance) between nodes is understood in the network sense: as the 
minimum number of edges which it is necessary to pass to get from one node to another. 
Respectively the average distance between nodes is defined as an average over all pairs of 
nodes for a given network. However for huge interconnection networks such an average 
distance may prove to be inadequate characteristic. The point is that for finding the 
shortest routes it is necessary to know global structure of the network (positions of all 
the shortcuts). Therefore the routing of messages using the shortest paths can be too 
cumbersome and ineffective because it requires storing and processing large data amounts. 
Thus routing algorithms based on local navigation [9], [10] become of special importance. 

In case of local navigation the node "knows" only a geographical position (position 
in the underlying lattice) of all nodes and the nearest neighbors with the account of 
shortcuts. Using only this information (information about all the shortcuts is not used) it 
is necessary to deliver a message to the destination node along shortest possible path. The 
simplest solution of this problem is to use the greedy algorithm: the current node sends 
the message to that of its neighbors which geographically (in the sense of coordinates on 
the underlying lattice) is closest to the destination node. Later we will consider a more 
complicated modification for the local navigation algorithm. 

In this work we investigate the average global and local navigation paths between nodes 
as the most important specifications defining communication properties of networks. The 
main goal is to develop an optimal algorithm for construction of a network with large 
number of nodes but small average global and/or local navigation path lengths between 
nodes. A general idea is to add to a regular lattice shortcuts according to an algorithm (or 
algorithms) in such a way that "cost-quality" ratio be optimized for the network obtained. 
Under the character of "cost" we choose unit wiring cost, i.e., total length of all shortcuts 
divided by the number of nodes in the network (total shortcuts length per node), while 
"quality" is the global or navigation path length between nodes. 

As it was already mentioned, the original algorithm [H] for small-world network con- 
struction is stochastic one: at each step of the algorithm the graph edges randomly change 
their positions with a given probability. As result of such an algorithm there appears an 
ensemble of graphs with some distribution of their characteristic properties, in particu- 
lar, with some distribution of average global and local navigation path lengths between 
nodes. For many real networks the stochasticity is an inherent property (e.g., this is true 
for Web and Internet; other examples see, e.g., in [S]). However in the case of interconnec- 
tion networks (as well as for other transportation networks, e.g. for airline networks) the 
design/construction process is under full control of a developer. Therefore the stochas- 
ticity is not inherent for this process. Thus an important question is the following: is 
there such a deterministic algorithm for lattice modification turning it into a small-world 
network that the cost-quality ratio be better than in the case of stochastic algorithms. 
Actually obtaining an answer to this question is one of the principal goals of this work. 

Though some results are valid for lattices of any dimension, in this work we con- 
sider the simplest case of one-dimensional lattices with torus topology (see fig. [l|a)). 
The main reason for this is a convenience of optimization of various algorithms and of 
comparison of networks properties because of shorter simulation time. Generalization to 
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high- dimensional lattices will be considered in subsequent papers. 

This paper is organized as follows. In the next section [2] a short review of both known 
in the literature and original algorithms of LNS construction is presented. In section [3] 
general properties of global average distance of the obtained LSP are considered while 
section [4] is devoted to consideration of average navigation path length. In section [5] we 
present a method for selection of best lattice network with shortcuts (LNS) on the basis 
of cost-quality ratio optimization. The section [6] contains a conclusion. 

2 Small world on a lattice: construction and main 
properties 

2.1 Algorithms for shortcuts establishing 

2.1.1 Stochastic algorithms 

The stochastic algorithm of small-world network construction preserving the underly- 
ing lattice was suggested in a number of works, in particular, [TT], [12], [13] (see also the 
review p] and refs. therein). In one-dimensional case it is formulated below. 

Algorithm SI 

1. The starting network is a one- dimensional lattice with L nodes and topology of a 
circle; 

2. one goes sequentially through all the network nodes and attach to each node % the 
first end of a shortcut with probability < p < 1; 

3. the second shortcut end (that is the node j to which this end is attached) cannot 
coincide with the neighbors of the first end in the sense of the underlying lattice 
and cannot result in duplication of an existent shortcut; in all other respects it is 
chosen randomly with probability 

P(r) ~ r ~ a , (1) 

which is the power-like function of the lattice distance r = between the nodes i 
and j. 

An ensemble of LNSs constructed by means of the algorithm SI is parametrized by 
L, p and a. With the fixed network size L, the parameters p and a should be optimized 
to achieve the best cost-quality ratio. A number of the shortcuts in the ensemble is 
distributed according to the binomial distribution (a number of successful "tests" in the 
L trials) so that the average number of the shortcuts is pL. 

In the search for the possible interconnection network structure the following modifi- 
cation of the algorithm SI may be useful. 
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Algorithm Sim 

1. The starting network is a one- dimensional lattice with L nodes and topology of a 
circle (as in the case of SI); 

2. a number of shortcuts t which have to be added to the lattice is fixed; 

3. out of all the L lattice nodes one randomly chooses t nodes and attaches to them 
first ends of shortcuts; 

4. the second end of each shortcut is chosen randomly and identical to the step [3] of 
the algorithm SI. 

An ensemble of LNSs constructed by means of the algorithm Sim is parametrized 
by L, t and a. The distinction from the previous basic algorithm is that in the case 
of SI both positions and the number of shortcuts are random entities while in the case 
of Sim only positions of the shortcuts are chosen randomly. In other words, the degree 
of stochasticity of the algorithm Sim becomes lower because of the contraction of the 
probabilistic sample space (the set of actual LNS instances). On the other hand, sum of 
the sample spaces for all allowable values of t in the case of Sim is equivalent to the sum 
of the sample spaces for all allowable values of p in the case of SI. Therefore the search 
for an optimal LNS created according to the algorithms SI or Sim have to lead to the 
same results under condition of large enough representative sampling. 

Since our principal use case concerns optimal large interconnection network for super- 
computers (which intrinsically is not stochastic) in our case the algorithm Sim is more 
preferable than SI. 

The next algorithm is the more essential modification of the algorithms SI and Sim 
and further reduces the degree of stochasticity. 



Algorithm S2 

1. The starting network is a one-dimensional lattice with L nodes and topolgy of a 
circle (as in the case of SI and Sim); 

2. a number of shortcuts t which have to be added to the lattice is fixed (similarly to 
Sim) and a fraction c < t of the shortcuts which should be added in special way is 
fixed; 

3. out of all the L lattice nodes one randomly chooses (t — c) nodes and attaches to 
them first ends of shortcuts; 

4. the second end of each of the (t — c) shortcuts is chosen randomly and identical to 
the step [3] of the algorithm SI; 

5. c shortcuts are added one by one and at each step: 

• one updates the list of all nodes to which shortcuts have been already added 
(taking into account both the shortcuts added at the previous steps [2]-|4] and 
at the current step of the algorithm); 
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• from the list obtained one node is randomly chosen and a new shortcut is added 
to it; 

• the second end of each of this additional c shortcuts is chosen randomly and 
identical to the step [I] (any new shortcut should not coincide with any of 
previously established or with an underlying lattice edge). 

An ensemble of LNSs constructed by means of the algorithm S2 is parametrized by 
L, t, c, a. In the algorithm S2 not only the number of shortcuts is fixed but also their 
positions are not quite random but rather subjected to an additional rule the degree 
of this regularity being determined by the parameter c (at c = the algorithm S2 is 
equivalent to Sim). In a sense S2 is a hybrid algorithm, i.e., partially stochastic and 
partially deterministic. 

The simulation shows that the average distance between nodes in case of S2 is smaller 
than for Sl/Slm for wide range of the parameter c. 

Notice that there possible other modifications of the stochastic algorithms. For ex- 
ample, in [H], [15] an algorithm with fixed total length of shortcuts is suggested. Such a 
version may be adequate to the case of working out of transport networks under condi- 
tion that only limited resources (e.g., financial) are available for real implementation of 
the shortcuts. However, in the use cases similar to the interconnection network design- 
ing a more adequate approach, in our opinion, is the multicriteria optimization of the 



cost-quality ratio which will be discussed in section 5.1 



2.1.2 Deterministic algorithms 

A most natural attempt of LNS construction with the small-world properties by means 
of a deterministic algorithm is based on the simple idea: at each step of the algorithm a 
shortcut joins two most distant (in the sense of the existent by this step network) nodes. 



Algorithm Dl 

1. The starting network is a one- dimensional lattice with L nodes and topology of a 
circle; 

2. a number of shortcuts t which have to be added to the lattice is fixed; 

3. one goes sequentially through all the pairs of the network nodes and chooses the 
ones separated by maximal network distance (i.e., with the account of shortcuts 
already added); 

4. from the set selected at the preceding step one chooses a pair with shortest lattice 
distance and joins it by a shortcut; 

5. the steps 3 and 4 are repeated t times. 

A version of this algorithm was discussed in [16]. Besides the lattice size L, this algorithm 
contains the only parameter t, which can be used for the cost-quality optimization. 

The next deterministic algorithm was suggested in |l7j . [T%] and named there by 
"hierarchical HN4 algorithm". 
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Algorithm D2 (HN4) 

1. The starting network is a one-dimensional lattice with L = 2 k nodes and topology 
of a circle; 

2. the nodes are labeled by integers n (0 < n < L) which, in turn, are parametrized 
by two integers {i, j}, < i < k as follows: 

n = T(2j + l); (2) 

3. for any given % (0 < i < k) defining a level of the hierarchy the closest (in the sense 
of the underlying lattice) neighbors are connected by shortcuts (i.e., a node with 
the number 2*(2j + l){mod L} is connected with the nodes 2*[2(j — 1) + l]{mod L} 
and 2 i [2(j + 1) + l]{mod L}). 

Following the general idea of the D2-algorithm one can construct LNSs in which the the 
hierarchy levels are defined by powers of other (not of 2) integers. 

The LNSs, constructed by the D2-algorithm, are reminiscent of the well-known circu- 
lant graphs (see, e.g., the review [TH] and refs. therein). Some subclass of the circulant 
graphs, namely the multiplicative circulant graphs, also have the small-world properties. 
Such graphs are denoted as follows: C(s k ; 1, s, s 2 , . . . , s k ~ 1 ), where s h = L is the network 
size, the unity corresponds to existence of the underlying lattice and s, s 2 , . . . , s fc_1 are 
lengths of the shortcuts. In particular, for diameter of these graphs one has ([19] and refs. 
therein) : 

V = k[s/2\ for odd s>l, 

(3) 

V = ks/2-[k/2\ for even s>l. 

Here [^J is the largest integer not greater than x. 

However the circulant networks have large total wiring cost per node which is equal 
(s h — l)/(s — 1). The next deterministic algorithm, suggested in [20] and pT] . uses the 
circulant networks with short-range shortcuts as starting ones and then supplement them 
with a long-range shortcuts which provide the small-world properties. Since detailed 
formulation of this algorithm is rather lengthy we present here only its basic ideas. The 
details can be found in the above cited works. 

Algorithm D3 

1. The starting network is a one- dimensional lattice with L nodes and topology of a 
circle or the circulant graph C(L; 1, . . . , K/2) (the usual lattice is equivalent to the 
circulant graph with K = 2); 

2. one chooses a subset of nodes from the starting network (the nodes from this subset 
are called hubs); 
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3. the hubs are connected by edges (shortcuts) so that this results in some graph H, 
whose nodes are the hubs and edges are the added shortcuts; 

4. in addition, one carries out a special local reorganization of the edges so that degrees 
of all the nodes be the same. 

On the basis of the ideas of the D2-algorith and the general structure of the multi- 
plicative circulant graphs, in this work we suggest a new algorithm providing logarithmic 
dependence of the average distance on the network size. Similarly to the multiplicative 
circulant graphs, the shortcuts of the lengths b,b 2 , . . . ,b k ( for some b, k G Z) are added. 
However, the number of the shortcuts decreases with their length so that unit wiring cost 
grows proportionally to k but not to b k . At first, we formulate a simplified version of such 
an algorithm. 



Algorithm D4s 

1. The starting network is a one-dimensional lattice with L = mb k (m G Z nodes and 
topology of a circle; 

2. starting from an arbitrary node all the network nodes are labeled by numbers from 
to L - 1; 

3. the nodes 0, b, 2b, . . . , L — b are connected by shortcuts in a cycle (the node is 
connected with b, b with 2b, etc.; L — b is connected with 0); 

4. for each i = 2, . . . , k the nodes 0, b % , 26*, . . . , L — b % are connected in a cycle. 

The networks constructed by this algorithm has the following structure (see fig. [2^a)): 
the end points of each of m shortcuts of the length b k are connected by a chain of b 
shortcuts of the length 6 fc_1 , the endpoints of the latters are connected by a chain of b 
shortcuts of the length b k ~ 2 , etc. It is easy to show that even for the simplest greedy 
navigation algorithm the maximal path length between any two nodes and therefore the 
average distance have the order 0((kb + m)). This implies that for fixed b and m = 
L/b k = 0(1) the average distance proved to be O (logL). More detailed estimations are 



presented in section 3.2 



We will call the networks constructed by this algorithm subcirculant ones. The short- 
coming of the subcirculant networks constructed by the algorithm D4s is the low robust- 
ness with respect to possible faults at the nodes 0, b k , . . . , {m — l)b k : since there are no 
bypass paths around these nodes a fault of any of them may considerably increase load 
(betweenness centrality) for other nodes. Moreover removal of a couple of these nodes 
makes the network a disconnected one. Besides the nodes of the highest level (i.e., the 
nodes with longest shortcuts) have high betweenness centrality and in the case of large 
network size plus relatively small b they have high node degree. These properties may 
cause problems in case of real implementations. 

In the following modification of the algorithm D4s the mentioned problems are solved 
by parting the cycles of the different levels to the neighboring (in the sense of the un- 
derlying lattice) nodes. In addition, the modified algorithm is generalized to the case of 
arbitrary network size L (not necessarily divisible by b). 
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Figure 2: LNSs constructed by the algorithm D4s (a) and by D4 (b); in both cases 
L — 81, 6 = 3, k = 3; the node labeled by the number "0" is depicted by empty circle 

Algorithm D4 

1. The starting network is a one- dimensional lattice with L nodes and topology of a 
circle; 

2. positive integers b, k satisfying the conditions 



are chosen; 

3. starting from an arbitrary node all the network nodes are labeled by numbers from 
to L - 1; 

4. k — 1 cycles are created in the following way: for i — 2, . . . , k — 1, k, the nodes 
i, i + b\ i + 26*, . . . , i + hib % where hi = [(L — i)/b % \ are consecutively connected 
by shortcuts in a cycle; 

5. one more cycle is created: 

• for j = 0, . . . ,hi where h\ = \_(L — l)/b\ one selects the nodes 1 + jb and 
then for each node chooses the nearest in order of the labeling node qj which 
does not have so far a shortcut; in such way on obtains a set of nodes C[ = 

• if this set contains neighbors in the sense of the underlying lattice, i.e. 
3 {Qj, Qj+i} '■ Qj+i ~ Qj — 1 then the node qj is removed from the set; the 
resulting set is denoted C\\ 



2<b<L/2 , 
b k + k < L , 
k<b 2 ; 



(4) 
(5) 
(6) 
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• the nodes from the set C\ are consecutively connected by shortcuts in a cycle. 

The D4 LNS structure provides that the local navigation algorithm comparing the 
lattice distance to a destination node for all neighbors of neighbors of a source node 
proves to be comparable in its effectiveness with the algorithm using globally shortest 
paths. In more details the properties of the D4 LNSs are discussed in sections [3] and |4j 

Notice that the condition ^ is not absolutely necessary and is used here for simplicity: 
if k > b 2 one has to varies lengths of some shortcuts in more than one cycle in order to 
consecutively put k nodes entering k different cycles. 

A sample of the graphs constructed by the algorithm D4 for L = 81, 6 = 3, A; = 3 is 
depicted in fig. |2](b). Of course, the samples in this figure are merely illustrative: actually 
we are interested in networks of essentially larger size. In particular, in the following 
sections we will present results for such networks with L > 10 4 . 



3 Average distance between nodes 

3.1 Average distance for stochastic LNS 

With the use of the stochastic algorithms the average path length d between nodes is 
a result of the double averaging: 

• over the statistical ensemble of random graphs, obtained as a result of the the 
stochastic process of shortcuts establishing; this averaging is denoted by angular 
brackets: (•); 

• over all the pairs of nodes for each instance of the network: 

The typical dependence of the average distance between nodes for LNSs, created by 
the algorithm Sim, on the parameters a and t is presented in fig. [3j The results are 
obtained by numerical simulation for an ensemble of 1000 instances with L = 10 4 nodes. 
For a > 2 the average shortcut length becomes too small and they do not influence the 
network properties so that the small- world properties disappear [TT], [T5] . 

From naive consideration of the results depicted in fig. [3] (for the algorithm SI and 
the parameter p the dependence is quite similar) one may conclude that to construct a 
network with the best value of the average distance between nodes it is enough to choose 
the largest possible value of t (or p) and the smallest value of a. However, as it was stressed 
in the Introduction, each solution has its cost and one has to optimize the cost-quality 
relation for sought network (otherwise the obvious solution is just the complete graph in 
which all the nodes are connected by direct links). Since in this work the total shortcuts 
length per node serves as the "cost" one has to analyze dependence of the mean distance 
on this quantity. Typical results of the numerical simulation for such a dependence are 
depicted in fig. |4j 

In fig. [4] every point with the respective value in gray scale corresponds to an instance 
of LNS in ensembles with L = 10 4 nodes, a = 0, 0.2, 0.4, 0.6, 0.8, 1.0 (the more the value 
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Figure 3: The dependence of the average path length on LNS's parameters 



d 

400- 




Figure 4: Dependence of the distances between nodes averaged over instances of the 
network on the total shortcuts length per node 
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Figure 5: Dependence of the average (over ensemble and instances) distance on the pa- 
rameter c (the number of forcibly conjoined shortcuts) in the case of the algorithm S2 for 
t = 300, a = 0, 0.4, 0.8, 1.2 and L = 10 4 

of a the lighter the shading) and with t = 100, 300, 1000; the vertical axis corresponds 
to the average (over instance) distance between nodes while the values on the horizontal 
axis shows actual (numerically calculated) total shortcuts length per node C/L [C is the 
total shortcuts length, L is the number of nodes). The logarithmic scale is used for both 
axis. In the case of the algorithm SI, a similar results were obtained but with wider 
distribution (as expected). These results imply that for decreasing the average distance 
at equal cost one have to choose an ensemble with maximally possible value of t (or p) 
and as well with maximally possible for the given cost value of a. In other words, large 
number of not too lengthy shortcuts is a cheaper solution. 

The results depicted in figj4] and as well similar results obtained for other values of 
the parameters give a more detailed representation of the dependence of the average 
distance on the cost than the results of the work [13]. In the latter the results for distance 
averaged both over instance and over ensemble is presented. Besides the "theoretical" 
total shortcut length CV /L = ps where s is the shortcut length averaged with respect to 
the distribution (jl]is used as the wiring cost. In our work we use real total shortcut length 
numerically calculated for a given instance of the networks. The presented detailed data 
shows that direct using cost-quality relation does not provide unequivocal choosing an 
optimal network (i.e., in the case of algorithms SI and Sim does not provide choosing the 
optimal values for p, t and a). Therefore one has to use methods of multicriteria analysis, 
see section [5] 

Numerical simulations show that average distances in LNSs, created by the algorithm 
S2, are smaller than in the case of Sl/Slm in wide range of the parameter c (the number 
of forcibly conjoined shortcuts), see fig. [HI. 

Thus from the point of view of the global average path length, S2 proves to be preferred 
among all the stochastic algorithms. For S2 one can also find dependence of the average 
distance on total shortcut length per node (unit wiring cost) similarly to fig. El However 
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Figure 6: Dependence of the average distance d on the shortcuts length per node C/L for 
one-dimensional networks with L = 10 4 in case of the deterministic algorithm Dl (black 
dots) and for the stochastic algorithm Sim (a = 0, 1, 2, the darker the the dots the 
larger the values of a); the number of shortcuts in Dl does not exceed L 

because of bigger number of parameters such a graphic presentation appears to be not 
quite instructive. Generally speaking, at the accounting of the wiring cost the algorithm 
S2 might be non-optimal even in the class of stochastic algorithms. However in the section 



5.1 we shall show with the help of the appropriate method that S2 indeed most preferable 



stochastic algorithm. 

3.2 Average distance for deterministic LNSs 

The deterministic algorithms produce a unique instance of the network. Therefore 
the average distance between nodes is defined similarly to ^ but without averaging over 
ensembles: 

i,3 

LNS constructed by the algorithm Dl. The behavior of the average distance be- 
tween nodes of LNS, constructed by the algorithm Dl, was investigated in [16] by means 
of numerical simulations. It was shown that with grows of the number of shortcuts t 
the average distance decreases as a power low. Notice that a simple consideration of the 
quality-cost dependence allows in some cases to make a choice among different types of 
the algorithms. In particular, the dependence of the average distance on the shortcuts 
length per node for deterministic algorithm Dl and for stochastic one Sim is presented 
in fig. U 

The results in fig. [6] shows that the cost-quality ratio for the algorithm Dl is worse 
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Table 1: Characteristic properties of LNSs constructed by D2 and D4 



L 


C/L for D2 


C/L for D4 


d for D2 


i& for D4 


1024 


8.5 


4.0 


11.4563 


12.4992 


4096 


10.5 


5.0 


17.7199 


17.0229 


16384 


12.5 


5.0 


26.3773 


21.8356 


65536 


14.5 


6.0 


38.0082 


27.0769 


262144 


16.5 


7.0 


53.3045 


32.4278 


524288 


17.5 


8.0 


62.6001 


35.6748 



than that for Sim for large enough a. Thus Sim (or SI) is more preferable than Dl. As 
it is clear from the discussion in the preceding section, this even more so for S2. 

LNS constructed by the algorithm D2. An estimation for the diameter of LNSs 
constructed by the algorithm D2 was obtained in [T7j, [18] : 

V ~ log 2 L) p , p ~ — + - . 9 

log 2 log 2 L 2 2 

Strictly speaking, the exponent (3 00 so that the diameter grows faster than any 
power of log 2 L. However (3 grows very slowly ranging merely from pa 1.44 to /? ~ 1.84 
over nine orders of magnitude of varying L: from L = 10 to L = 10 10 [18] . Therefore in 
practice it is possible to consider that such network possesses property of the small world. 
This is confirmed by results of our numerical simulations presented in the Table [TJ 

The shortcuts length can be easily calculated: indeed, at each level 1, 2, . . . , k — 2 the 
shortcuts goes consequently one-by-one and encircle all the L = 2 k nodes of the underlying 
lattice. At the level k — 1 there is one shortcut of the length L/2. As a result the total 
shortcuts length appears to be Lik — 3/2), and the length per node is C/L = k — 3/2. 

However, the results presented in the table [T] show that the subcirculant D4-networks 
have lower cost and for L > 4000 have lower navigation length £^ than the global average 
distance d in case of the algorithm D2 (the superscript "(2)" in £^ means that the two- 
level navigation algorithm is used, see section|4]). Since the global distance does not exceed 
any navigation path length, D4 is a fortiori more preferred algorithm in comparison with 
D2. 

LNS constructed by the algorithm D3. Estimations for the diameter of networks 
constructed by the algorithm D3 were obtained in [20]. In particular, 

V<2\(\L/h]-l)/K]+D H . (10) 

Here Dq is the diameter of the network with the shortcuts, Dh is the diameter of the 
graph H, h is the number of hubs; \x] is the smallest integer not less than x. As the 
graph H connecting the hubs, two types of graphs were considered: star graphs Sih-i 
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(one node is connected to all h — 1 other nodes) with the diameter 2 and double loop 
circulant graphs C{h; a, b) with the diameter 

D C (h;a, b )= r(-l + V / 2/^T)/2] . 

LNS constructed by the algorithm D4s. Average distance between nodes in LNSs 
for the algorithm D4s does not exceed an average path length constructed by the following 
algorithm: 

• from the initial node i move to the nearest node with the label v±b through which 
passes a cycle made of shortcuts of length b (let us call it the first level cycle); then 
using the first-level shortcuts move to the nearest node v^b 2 entering the second-level 
cycle and continuing in this way reach the node N k (i) = v^b k entering the top-level 
cycle; 

• a similar (though backward) path is created from the destination node j to the node 
N k (j) = v' k b k ; 

• move from N k (i) to N k (j) by shortcuts of the length b k and then by the path 
constructed at the preceding step move to the destination node j. 

Total length of all such paths is equal 

L-l 

2(L-1)£*«(0+ ^'), 

i=0 0<i,j<L 

where l a (i) is length of the "ascending" path from i to N k (i) and k(i,j) is the number of 
the top-level shortcuts between N k (i) and N k (j). For reasons of symmetry one finds that 
the number of nodes for which N k (i) = ub k is equal to b k for each u = 0, 1, . . . , m — 1. 
Therefore 

ra-l Am/21 [m/2\ \ 32fc 

h(i,j) = mb 2k h(0,ub k ) = mb 2k MT u + « < ^f" • 

0<i,j<L u=0 \ u=0 u=l / 

The total length of "ascending" paths reads 




from which on finds that the total length of entire paths does not exceed (L — l)L/c| + ^L 2 



'2 1 4 

h2_ 



for even b and (L — l)Lk^-^ + ^L 2 for odd b. For odd m the last item can be substituted 
with m ^ m l L 2 so that for m = b and odd b the total length of entire paths does not exceed 
(2(L - l)Lk + L 2 ) b -^. Hence the upper bound for the average distance reads 

, bk m L 

a < h — tor eveno, 

2 A L-l 
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. _ (6 2 - l)Jfc m L . , .. 
In particular, for m = b and even 6: 

for m = b and odd b: 

b 2 - 1 / 1 
rf< — — 21og,L-l 



46 \ " L - 1 



LNS constructed by the algorithm D4. Similarly to the case D4s for the average 
distance can be found upper bound by consideration of paths for all pairs of nodes. In 
the case L = mb k , k < b the estimations reads 



bk m L 

a < 1 h 2k — 2 for even b , 

~~ 2 4 L- 1 



(b 2 + l)k m L 

d < - t^ + ^t + 2A;-2 for odd b 

2b 4 L - 1 



(the term 2k — 2 is added because for transition from the cycle of the level % to the cycle 
of the level i + 1 for % — 1, . . . , k — 1 one has to make an additional step along an edge of 
the underlying lattice). 

If the conditions L = mb k and k < b is not fulfilled, lengths of first-level shortcuts and 
the number of steps over shortcuts of a level % necessary for reaching the node i +v i+±b t+1 
which is next to the node i + 1 + v i+ib l+1 entering the i + 1-level cycle, may prove to be 
greater then b. However these differences affect a small part of the nodes. We suppose, 



on the analogy with (Jl 1 p that in this case the following upper bound is correct: 

(12) 



d<\ 



k(b + A) + ±_ 2 



2 Ab k 



where A ~ 1. 

From this estimation it follows that with an appropriate choice of the parameters 
the average distance in the subcirculant networks grows no faster than log L. In par- 
ticular, d < 4(it - 2) + 2 = 21og 2 L - 6 for L = 2 2u , b = 4, k = u - 2. Numerical 
simulations confirm this conclusion: the results for d in D4-networks with b = 4, 5 for 
L = 1000, 2000, 5000, 10000, • • • , 500000 is depicted in fig. [7j the parameter k being 



maximal integer such that b k < L/2. The estimation of the upper bound (12) for A = 1 
is also presented in this figure. It is seen that the average distance indeed logarithmically 
grows with growth of the network size and the estimation obtained correctly reflects this 
dependence. 



As it was mentioned in section 2.1.2, multiplicative circulant graphs also possess the 
small- world properties (cf. A comparison of the circulant graphs and suggested in 

our work subcirculant graphs will be presented in section [5j 
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Figure 7: Average distances d in D4-networks with b = 4, 5 for L = 
1000, 2000, 5000, 10000,- •• , 500000; k is the maximum integer such that b k < L/2; 



the dotted line is the upper bound ( 12 ) at A = 1 



4 Local navigation in LNSs 

As it was noticed in Introduction, from the point of view of optimization of parameters 
of stochastic LNSs the dependence of navigation length on parameters of the algorithms is 
of special interest. Pioneering work in the field of local navigation in spatially embedded 
networks with small-world properties is [10] by Kleynberg while the very formulation of 
the problem goes back to the classical Milgram's experiment [9] . 

In short the navigation problem in LNSs is set as follows: 

• a node "knows" only position of the destination node in the underlying lattice and 
all its neighbors with the account of shortcuts (in more elaborated versions the node 
may "know" its neighbors with the depth more than unity); 

• the problem is to find the shortest way to destination using only this local in- 
formation (without knowing all the global LNS structure, in other words without 
knowledge about positions of all the shortcuts). 

The simplest local navigation follows the greedy algorithm: an entity goes from a node 
to that of its neighbors which geographically (i.e., in the sense of coordinates on the 
underlying lattice) is closest to the destination node. 

The most important characteristic property of the navigation process is the average 
navigation path length between nodes of a network which is defined similarly to ([7]), ([8]): 

2 



I» = 77itiTl>> ' (13) 



L(L-1)^ VJ ' i(i-l) 

where 1^ the navigation length between nodes i and j. Of course for deterministic algo- 
rithms the averaging over ensemble (•) is absent. 
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Figure 8: Dependence of the distribution of the average (over instances) naviga- 
tion path length i on shortcut length at three different values of t and for a = 
0, 0.2, 0.4, 0.6, 0.8, 1.0; the lighter the shading the larger value of a 

4.1 Navigation path length for stochastic algorithms 

The dependence of distributions of the averaged (over instances) navigation path 
lengths i on shortcuts length per node as well as on the parameters t and a in the 
case of the algorithm Sim is shown in fig. [8] For SI the behaviour is similar when t is 
substituted by p. It is seen that there exist a minimum of I at different values of a de- 
pending on t. Predictably the more shortcuts the smaller average navigation path length 
in the minimum. It is seen also that for a given cost one can find the parameters of the 
algorithm Sim with smallest navigation path length and vice versa for given navigation 
path length one can find the parameters with smallest total shortcuts length per node. 
Therefore in this case one has to solve the cost-quality optimization problem too. 

Decrease of the average navigation path length for S2 with growth of the number c of 
forcibly conjoined shortcuts is presented in fig. [9j As it will be shown in section [5j S2 has 
the best cost-quality ratio among all the stochastic algorithms. 

4.2 Navigation path length for hierarchical deterministic algo- 
rithms 

Numerical simulations show that the global average distance between nodes in the 
networks constructed by algorithms D2, D4s and D4 is very small. However such networks 
possess rather large average path length in the case of the local navigation based on the 
elementary greedy algorithm. Accordingly, in this work the modified algorithm of local 
navigation, namely the two-level local navigation, is suggested. In this modification not 
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Figure 10: Dependence of the average path length £^ for the two- level navigation on 
logarithm of the number of nodes; 6 = 4 (lower curve), 5, (middle curve) 6 (upper curve); 
k is chosen to minimize for a given L 



only the nearest neighbors of a given node are taken into account but also neighbors of 
neighbors. Thus in the case of an interconnection network a message from a given node 
is transferred to that its neighbor which in turn has a neighbor closest to the destination 
node in the sense of the underlying lattice. The two-level greedy algorithm requires a 
little bit more calculations at each node but still remains local and well scalable. The 
average path length in case of two-level local navigation is denoted by £^ 2 \ Notice that 
the global average path length d can be considered as a limiting case of navigation length 
with the infinite depth of search. The results of numerical simulations for £^ in LNSs 



constructed by the algorithm D4 is shown in fig. 10 
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5 Selecting the algorithm for LNS construction 



5.1 Multicriteria cost-quality optimization 

Multicriteria (or multiobjective) optimization has been applied in many fields of sci- 
ence where optimal decisions need to be taken in the presence of trade-offs between two 
or more conflicting objectives. In our case we will consider two criteria: average path 
length between nodes (global d or navig ation £, £^) that expresses "quality" of an LNS, 
and shortcuts length per node C/L that reflects "cost" of creation of the LNS. From the 
consideration in the preceding sections it is obvious that these two quantities are interre- 
lated: by increasing the cost C/L it is possible to improve quality (to reduce d, £ or £ < - 2 * 1 ), 
and, conversely, improving the quality causes increasing the cost. 

There exist a number of approaches to multicriteria optimization (see, e.g. [22\). We 
will use the method of weighted sum (in more general context, scalarizing multiobjective 
optimization problems). For this aim we define the following target functions: 

G w = wd+(l-w)C/L , (14) 

G' w = w£ + (l-w)C/L, (15) 

G" w = w£^ + (l-w)C/L. (16) 

Minimization of these target functions means that optimal values for parameters of 
the algorithms are picked out from point of view of quality (small average path length) 
and cost (total shortcuts length per node). With such a minimization the parameter 
< w < 1 characterizes the relative significance of the each of the criteria (cost and 
quality). In other words, the suggested method of optimization implies that for each 
value of the significance parameter w one have to find values of the algorithm parameters 
(e.g., p/t and a for Sl/Slm; t, c and a for S2; b and k for D4; the network size L is 
supposed to be fixed) which minimizes G w , G' w and G" w . 

To illustrate an application of this method for LNS analysis the values of the parame- 



ters t and a which minimize G w are presented in fig, 11 for network ensembles constructed 
by Sim. These results imply that when the quality significance w is greater than 0.35, 
the number of the shortcuts must be maximum possible (equal to the number of nodes) 
while at smaller values of w the importance of shortcuts length minimization leads to 
decreasing the number of shortcuts (values of t). On the contrary, values of a close to 
2 (the critical value for one-dimensional small- world LNS [11]) are optimal ones for w 
smaller than 0.5, that is in the domain of prevailing cost. 

Results of calculation of minimum values of G w , G' w and G" w for the Sim LNSs are 
presented in fig. 12 [L = 10 4 ; averaging over ensembles of 100-1000 instances). Each 



point on the curves corresponds to an ensemble with parameters t, a providing minimum 
G w , G' w and G" w for a given w. It is seen that in the total range of w the function G w 
( "infinite" searching depth) has smallest values, the values of G" w (searching depth is equal 
to 2) is greater than those for G w and the values of G' w (searching depth is equal to 1) 
are the largest ones. This result is intuitively understandable: the more information one 
uses for searching a shortest path the smaller values of the target function. 
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Figure 11: Dependence of optimal values of a (a) and t (b), minimizing G w , on the quality 
significance parameter w (L — 10 4 ) 




Figure 12: Dependence of minimum values of the target functions on w in the case of 
algorithm Sim: the curve 1 — min G w ; the curve 2 — min G" w ; the curve 3 — min G' w 
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Figure 13: Dependence of minimum values of the target functions on w: the curve 1 - 
min G" w for S2; 2 — min G" w for Sim; 3 — min G' w for S2; 4 — min G' w for Sim 

More important that the target functions G w , G' w and G" w make possible to compare 
different algorithms of LNS construction including those with different set of parameters. 

5.2 Simulation results for different algorithms 



Results of the comparison of two stochastic algorithms is presented in fig. 13 They 
show that with the use both of two-level and ordinary (one-level) navigation, S2 is the 
preferred algorithm. 

A comparison of minimal values of the target functions for the algorithms D3, D4 as 



well as for the multiplicative circulant graphs is shown in fig. [14J It is seen that if the 
cost is of the essential significance (w < 0.8), D4 has advantage over D3 and the circulant 
graphs. On the contrary in the case of prevailing quality (w > 0.8), D3 may be more 
effective especially with star-like connection of hubs. This is not surprising because the 
star-like graph has quite small diameter T> = 2. However we have mentioned already that 
the star-like configuration is too fragile since in case of a failure of the central node the 
structure proves to be completely disintegrated. 

Thus D4 is the preferred algorithm in the class of deterministic algorithms in wide 
range of the parameter of quality significance w. A comparison with stochastic algorithms 



remains to carry out. An appropriate data are presented in fig. 15 where minimal values 
of the target functions for S2, D3 (with double loop connection of the hubs), and D4 
are shown. The results show that practically in the whole range of the parameter w the 
deterministic algorithms D3 and D4 are preferred. 

It is worth noticing that the comparison with the stochastic algorithms was carried 
out for the characteristics averaged over ensembles of LNSs. One may wonder if selecting 
instances with good characteristics may result in essential improvement of the cost-quality 
ratio. The numerical simulation shows that in the case of the best stochastic algorithm 



S2 such an improvement is hardly possible, see fig. 16 
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Figure 14: The dashed curve 1 — min G w for the multiplicative circulant graphs; the 
dotted curve 2 — min G w for D3 with star-like connection of hubs; the dashed curve 3 - 
min G w for D3 with double-loop connection of hubs; solid curve 4 — min G" w for D4; 
L = 10 4 




Figure 15: Dependence of minimum values of the target functions on w: the dotted curve 
1 — min G" w for S2; the dashed curve 2 — min G w for D3 with double- loop connection 
of hubs; the solid curve 3 — min G" w for D4; L = 10 4 
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Figure 16: Dependence of minimum values of the target functions G" w on w for the algo- 
rithm S2: solid curve — for characteristics averaged over ensemble; dashed curve — for 
instances from the ensemble having in a sample of 100 instances the fifth in the order of 
increasing averaged path length between nodes (fifth percentile) 



6 Conclusion 

In this work we considered a number of both known in the literature and original 
algorithms for construction of complex networks with the property of a small world, 
namely with slow (logarithmic) growth of average distance between nodes with growth 
of the network size. The networks constructed on the basis of these algorithms have 
basic structure of a regular lattice with additional shortcuts providing the small-world 
properties. We suggested a method for comparison of various algorithms on the basis 
of optimization of the cost-quality ratio. Under the character of "cost" we have chosen 
unit wiring cost, i.e., total length of all shortcuts divided by the number of nodes in the 
network (total shortcuts length per node), while "quality" is the global or navigation 
average path length between nodes. The optimization is carried out for a weighted sum 
in which the relative significance of the cost and quality is defined by a value of an 
appropriate parameter (weight). 

The two main classes of the algorithms considered are stochastic and deterministic 
ones. Therefore, an important question is which type of the algorithms has better cost- 
quality ratio. As a result of the investigations it has been shown that in the case of 
one-dimensional underlying lattice and from the point of view of the cost-quality ratio 
the preferred networks are proved to be the subcirculant ones which are constructed by 
means of the deterministic algorithm (denoted in this work as D4). The subcirculant 
networks have the structure similar to that of the well known multiplicative circulant 
graphs but with essentially smaller number of shortcuts (and, respectively, lower wiring 
cost). 

It necessary to stress that though in this work the principal use case was designing 



25 



an optimal interconnection networks for future generation supercomputers, we believe 
that the results obtained can find applications in many other areas when communication 
properties of some regular network need to be improved without destroying the underlying 
basic structure. 

In the subsequent publications we will continue the consideration of the lattice net- 
works with shortcuts (LNS) both from the point of view of other characteristics (in partic- 
ular, beweeness centrality, fault tolerance, etc.) and generalizations to higher dimensional 
underlying lattices. 

This work is partially supported by RFBR grant No. 12-07-00408-a. 
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